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We consider adapting the transmission rate to maximize the goodput, i.e., the amount of data transmitted 
without error, over a continuous Markov flat-fading wireless channel. In particular, we consider schemes in which 
transmitter channel state is inferred from degraded causal error-rate feedback, such as packet-level ACK/NAKs in 
■ an automatic repeat request (ARQ) system. In such schemes, the choice of transmission rate affects not only the 

subsequent goodput but also the subsequent feedback, implying that the optimal rate schedule is given by a partially 
observable Markov decision process (POMDP). Because solution of the POMDP is computationally impractical, 
■^d^ I we consider simple suboptimal greedy rate assignment and show that the optimal scheme would itself be greedy if 

the error-rate feedback was non-degraded. Furthermore, we show that greedy rate assignment using non-degraded 
feedback yields a total goodput that upper bounds that of optimal rate assignment using degraded feedback. We then 
detail the implementation of the greedy scheme and propose a reduced-complexity greedy scheme that adapts the 
transmission rate only once per block of packets. We also investigate the performance of the schemes numerically, 
^ ■ and show that the proposed greedy scheme achieves steady-state goodputs that are reasonably close to the upper 

bound on goodput calculated using non-degraded feedback. A similar improvement is obtained in steady-state 
goodput, drop rate, and average buffer occupancy in the presence of data buffers. We also investigate an upper 
. _ bound on the performance of optimal rate assignment for a discrete approximation of the channel and show that 

QQ ■ such quantization leads to a significant loss in achievable goodput. 

I Keywords: Communication systems, adaptive modulation, ARQ, cross-layer strategies. 
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. I. Introduction 

^ ■ Channel variability is common to all wireless communications due to factors such as fading, mobility, 
O and multiuser interference. One way to combat the detrimental effects of channel variability is through rate 
j> adaptation [1]-[15]. The idea is that, based on the predicted channel state, the transmitter optimizes the 
data rate in an effort to maximize the amount of data communicated without error, i.e., the goodput. For 



example, when the channel quality is below average, the data rate should be decreased to avoid reception 
5^ , errors, while, when the channel quality is above average, the data rate should be increased to prevent the 
channel from being underutilized. 

Rate adaptation would be relatively straightforward if the transmitter could perfectly predict the channel. 
In practice, however, maintaining accurate transmitter channel state information is a nontrivial task that 
can consume valuable resources. In pilot aided rate adaptation (e.g., [l]-[4]), the transmitter broadcasts 
physical-layer pilots that the receiver uses to estimate the channel. The receiver's channel estimates (or 
quantized versions thereof) are then fed back to the transmitter for use in rate adaptation. Because this 
approach consumes bandwidth on both the forward and reverse links, there is a strong motivation to 
consider non-pilot- aided approaches. In this paper, we focus on rate adaptation schemes that infer the 
channel state by monitoring packet acknowledgments/ negative-acknowledgments (ACK/NAKs) [5]-[15], 
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i.e., the feedback information used for automatic repeat request (ARQ). Since ARQ feedback is a standard 
provision of the link layer, its use by the physical layer comes essentially "for free." 

From the viewpoint of rate adaptation, though, ARQ feedback is quite different than pilot-aided 
feedback. While pilot-aided feedback provides an indicator of the absolute channel gain, ARQ provides 
an indicator of the channel gain relative to the chosen data rate; an ACK implies that the channel was 
good enough to support the rate while a NAK implies otherwise. As a result, with ARQ-feedback based 
rate adaptation, the chosen data rate affects not only the subsequent goodput but also the quality of the 
subsequent feedback, which in turn will affect future goodputs through future rate assignments. In fact, 
with ARQ feedback, optimal rate assignment for communication over Markov channels can be recognized 
as a dynamic program [14], in particular, a partially observable Markov decision process (POMDP) [16]. 

In this paper, we consider the general problem of adapting the transmission rate via delayed and 
degraded error-rate feedback in order to maximize long-term expected goodput. In order to circumvent 
the sub-optimality of finite-state channel approximations [17], we assume a Markov channel indexed by a 
continuous parameter. Because the optimal solution of the POMDP is too difficult to obtain, we consider 
the use of greedy rate adaptation. First we establish that the optimal rate assignment is itself greedy when 
the error-rate feedback is not degraded. Furthermore, we establish that the greedy non-degraded scheme 
can be used to upper bound the optimal degraded scheme in terms of long-term goodput. Second, we 
outline a novel implementation of the greedy rate assignment scheme. For the example case of binary (i.e., 
ACK/NAK) degraded error-rate feedback, a Rayleigh-fading channel, and uncoded QAM modulation, we 
show (numerically) that the long-term goodput achieved by our greedy rate assignment scheme is close 
to the upper bound. 

Compared to the previous works [5]-[13], which are ad hoc in nature, we take a more structured 
approach to cross-layer rate adaptation. Compared to the POMDP-based work [14], our work differs in 
the following key aspects: 1) we employ a continuous-state Markov channel model, 2) we consider delay 
in the feedback channel, and 3) we propose simpler greedy heuristics, which we study analytically as well 
as numerically. Though our adaptation objective — goodput maximization — does not explicitly consider the 
input buffer statefl as does the one in [14], we show (numerically) that finite buffer effects (e.g., packet 
delay and drop rate) are handled gracefully by our greedy algorithms. In fact, one could argue that, 
since only successfully communicated packets are removed from the input queue, the maximization of 
short-term goodput — our greedy objective — leads simultaneously to the minimization of buffer occupancy. 

The paper is organized as follows. In Section |Ill we outline our system model, and in Section |llll 
we consider optimal rate adaptation and suboptimal greedy approaches. In Section |IVl we detail a novel 
implementation of the greedy rate-assignment scheme, which we then analyze numerically in Section |V] 
for the case of uncoded QAM transmission, ACK/NAK feedback, and a Rayleigh-fading channel. We 
summarize our findings in Section |VIl 

II. System Model 

We consider a packetized transmission system in which the transmitter receives delayed and degraded 
feedback on the success of previous packet transmissions (e.g., binary ACK/NAKs), which it uses to 
adapt the subsequent transmission parameters. In particular, we assume the use of a transmission scheme 
parameterized by a data rate of rt bits per packet, where t denotes the packet index. For simplicity, we 
assume a fixed transmission power and a fixed packet length of p channel uses. 

Figure [T] shows the system model. The time-varying wireless channel is modeled by an SNR process 
{7i}, where the SNR 7^ ^ is assumed to be constant over the packet duration. Notice that 7^ is not 
assumed to be a discrete parameter. Since the transmission power is fixed, {7^} is an exogenous process 
that does not depend on the transmission parameters. The instantaneous packet error rate e{rt,'jt) varies 
with the rate rt and SNR 7^ according to the particular modulation/demodulation scheme in use. The 

' For algorithm design, we assume an infinitely back-logged queue. 
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instantaneous goodput G{rt,'-ft), defined as tlie number of successfully communicated bits per channel 
use, is then given by 

G{n,^t) = (l-e(ri,7i))ri, (1) 

where e{rt, 7t) denotes the error rate at time t. The transmitter uses q.^^, a degraded version of the (rf ^ 1 
delayed) error rate e{rt-d, 'Jt-d) to choose the time-t rate parameter rf. We will assume that, for each r^, 
the function e(rt,7j) is monotonically decreasing in 7^, so that 7* can be uniquely determined given rt 
and the true error rate e{rt,'jt)- 

Example 1: As an illustrative example, we now consider uncoded quadrature amplitude modulation 
(QAM) using a square constellation of size m, and minimum-distance decision making. At the link layer, 
where symbols are grouped to form packets, a fixed number of extra cyclic redundancy check (CRC) 
bits are appended to each packet for the purpose of error detection. We will assume the probability of 
undetected error is negligible and the associated ACK/NAK error feedback is sent back to the transmitter 
over an error-free reverse channel. We will also assume that the number of CRC bits is small compared 
to the packet size, allowing us to ignore them in goodput calculations. 

Under an AWGN channel, and with 7 describing the ratio of received symbol power to additive noise 
power, the symbol error rate for minimum-distance decision making is [18, p. 280] 




1-2 1-^ Q W -\ , (2) 



where Q{-) denotes the Q-function [18]. If we assume that the constellation size is fixed over the packet 
duration, then the data rate equals rt = p log2 rrit and the packet error rate equals 

Plugging ([3]) into ^ yields the instantaneous goodput expression, which identifies a particular one-to-one 
mapping between rate and goodput for a fixed SNR 7^. Thus, if the SNR was known perfectly, then 
the goodput could be maximized by appropriate choice of constellation size. Figure [2] plots instantaneous 
goodput contours versus SNR 7^ and constellation size rrit for the case of p = 100 symbols per packet. 
Figure [2] also plots the (unique) goodput-maximizing constellation size as a function of SNR. Here the 
finite set of allowed constellation choices (and hence rates) is apparent. Note that, for this uncoded 
communication scheme, the SNR must be relatively high to facilitate rate adaptation; as long as the SNR 
remains below 14 dB, the goodput-maximizing constellation size remains at m = 4 (i.e., QPSK). Coded 
transmission, on the other hand, could facilitate rate adaptation at lower SNRs. 

With ACK/NAK error feedback, the degraded error-rate it is a Bernoulli random variable generated 
from e(rf,7f) according to the conditional probability mass function 



e{rt,-ft)) = {l-e{rt,-ft) k = (4) 




[0 else. 

□ 

While the previous example focuses on a particular modulation/demodulation scheme and a particular 
error feedback model, we emphasize that the principal results in the sequel are general; no particular 
modulation/demodulation scheme and error feedback model are assumed. 
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III. Optimal Rate Adaptation 

In this section we formalize the problem of finite horizon goodput maximization. For convenience 
we assume that process {7t,et,rt} has been initiated at time t = — oo, though we consider only the 
finite sequence of packet indices {0, . . . , T} for goodput maximization. Also, we use the abbreviation 

et = e(rt,7t). 

For every packet index t ^ 0, we assume that the rate controller has access to the (degraded) error-rate 
feedback et-d = [•••, e-2, e-i, eo, • • • , et-d], where 1 denotes the causal feedback delay. Formally, we 
consider et^d to be degraded relative to the true error-rate vector St^d if 

E{G{rt,-ft) I f^t~d,rt^d} ^{G{rt,7t) I et-d,rt-d} (5) 

= E{G(n,7i) I 7t_J, (6) 

where rt_d = [•••, r_2, r-i, tq, . . . , rt_ J . Equation © follows because each SNR 7^ in 7^,^ = [•••, 7-2, l-i, 7o, 
can be uniquely determined from the pair (efc,rfc). 

At packet index t, the optimal controller uses the degraded error-rate sequence et-d (as well as 
knowledge of the previously chosen rates Vt^d) to choose the rate rt from a set TZ of admissible rates in 
order to maximize the total expected goodput for the current and remaining packets: 



= argmax E<^ G'(rj,7j) + ^ G{r^,jk) 
I k=t+i 



et-d,rt-d} for t = 0,...,T. (7) 



The optimal expected sum goodput for packets {t, . . . ,T} can then be written (for t ^ 0) as 



et-d,rt-d}- (8) 



k k=t 

For a unij^ delay system (i.e., d = 1), the following Bellman equation [19] specifies the associated 
finite-horizon dynamic programming problem: 

Gt(et_i,rt„i) = maxf E{G(rt,7t) | et„i,rt_i} 

+ E{G,V([e,_i,eJ,[n_i,ri]) | ei„i,r<„i}}, (9) 

where the second expectation is over q. The solution to this problem is sometimes referred to as a partially 
observable Markov decision process (POMDP) [16]. 

For practical horizons T, optimal rate selection based on ^ is computationally impractical, in part due 
to the continuous-state nature of the channelJl In fact, it is known that POMDPs are PSPACE-complete, 
i.e., they require both complexity and memory that grow exponentially with the horizon T [20]. For 
an intuitive understanding of this phenomenon, notice from ^ that the solution of the rate assignment 
problem at every time t depends on the optimal rate assignments up to time t — 1. But, because both 
terms on the right side of Q are dependent on r^, the solution of the rate assignment problem at time t 
also depends on the solution of the rate assignment problem at time t + 1, which in turn depends on the 
solution of the rate assignment problem at time t + 2, and so on. Consequently, the much simpler greedy 
rate assignment scheme 

rt = argmax E{G(rt,7t) | et-d,rt-d} for t = 0,...,T, (10) 

is suboptimal. 

The question of principal interest is then: What is the loss in goodput with the greedy scheme diOl) 
relative to the optimal scheme ([7|) ? Since it is too difficult to compute the optimal goodput (which depends 

^ For the d> 1 case, the Bellman equation is more complicated, and so we omit it for brevity. 

^ Though a quantized channel approximation could — with few enough states — yield a practical POMDP solution, we show in Section fVl 
that channel quantization leads to significant loss in goodput. 
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on the optimal rate assignment r^), we instead compare the greedy scheme (flOl) to an upper bound on 
the optimal goodput. To establish the upper bound, we show that greedy rate assignment using non- 
degraded error-rate feedback yields a total goodput that is no less than that of optimal rate assignment 
using degraded error-rate feedback. While the latter is difficult to compute, the former is not. 

We now detail the rate assignment scheme that leads to our total-goodput upper bound. At packet index 
t, consider the rate assignment that maximizes the total expected goodput for the current and remaining 
packets using knowledge of the non-degraded feedback e^^^: 



T 



'"'^^ I k=t+i 



et-d, rt-d \ for t = 0, . . . , T. 



(11) 



We refer to this scheme as the causal genie. Note that (fTTI) differs from (|7]) only in that et-d is used in 
place of et_d- Because 7^_^ can be uniquely determined from {et^d^Tt-d)^ the causal genie can also be 
written as 



argmax E<^G(rt,7j)+ V G{ 



It-d 



for t = 0, . . . , T. 



(12) 



Since the choice of {r''^}'k=t+i ^iU not depend on the choice of r^, the optimal expected sum goodput for 
packets {t, . . . , T} can be written (for t ^ 0) as 



Gf{et.d,rt-d) 



E 



ax E|G(r„7*)+ E G^^k.lk 
=^ I k=t+i 

\ E G'(rI^7. 

I k=t+l 



maxE{G(rt,7t) | 



(13) 



(14) 



which shows that optimal rate assignment under non-degraded causal error-rate feedback can be accom- 
plished greedily. In other words. 



argmax E{G(n,7t) | j^^^} 

rt&Tc 

argmax E{G{rt,-ft) \ €t-d,rt-d}- 

rteTc 



(15) 
(16) 



We now establish that the causal genie controller upper bounds the optimal controller with degraded 
error-rate feedback in the sense of total goodput. Though the result may be intuitive, the proof provides 
insight into the relationship between degradation of the feedback and reduction of the total expected 
goodput. 

Lemma 1: Given arbitrary past rates r^d and corresponding degraded error-rate feedback e^d, the 
expected total goodput for optimal rate allocation under degraded feedback is no higher than the expected 
total goodput for the causal-genie rate allocation under non-degraded feedback, i.e.. 



e{EG«,7*) 



e-d,r.d 



Proof: For any t G {0, . . . , T} and any realization of (e^^^, r^^^), we can write 

I et-d,rt-d} 
= max E I E{G'(rt,7t) | et-d,rt^d, et-d} 
^ Ejmax: E{G{rt,'yt) \ et^d,rt^d, ^t^d} 
= Ejmax E{G(rt,7t) | Jt^^} et^d,rt^d 



E{G(r4*,7t) I et-d,rt-d} ^ max E{G'(n,7t) 

rt&l 



et-d,rt-d^ 
et-d,rt-d\ 



] 



E{G(r,^^7^) | et-d^Vt^d}. 



(17) 

(18) 

(19) 

(20) 

(21) 
(22) 



6 



where (fTSi) follows since is chosen to maximize the long term goodput — not the instantaneous goodput; 
(|20l ) follows since max^^ E{/(rt)} ^ Ejmax^^ f{rt)} for any /(■); (|2TI) follows by definition of degraded 
feedback; and (|22l) follows by definition of the greedy genie. Taking the expectation over {et-d,rt-d), 
conditional on (e^dif^d), we find 

E{G(r*,7i) I e^d,r^d} ^ E{G(rf,7j | e^d,r^d}- (23) 

Finally, summing both sides of (l23l) over t = {0, . . . , T} yields (flTl) . ■ 
In Section HV] we study the greedy rate assignment scheme (flOl) in depth. Then, in Section |Vl we study 
(numerically) the particular case in which {et}t^o is constructed from link-layer ACK/NAKs. 

IV. The Greedy Rate Adaptation Algorithm 

In this section, we detail the implementation of greedy rate assignment (flOl) assuming continuous 
Markov SNR variation and conditionally independent error-rate estimates. In Section IIV-A[ we detail a 
procedure for packet-rate adaptation, while in Section IIV-B[ we consider adapting the rate once per block 
of n packets. 

A. Packet-Rate Algorithm 

Assuming a feedback delay of d ^ 1 packets, the greedy rate assignment (fTOl) can be rewritten as 

ft = argmax / G{rt, '^t)p{lt \ ^t~d, rt^d) d^it for t = 0, . . . , T. (24) 

We now derive a recursive implementation of the greedy rate assignment (l24l) . 
Expanding the inferred SNR distribution via Bayes rule, we find 

Vilt I 7i-d, h-d, rt^d)v{lt~d I et_d, rt„d) d'^t-d (25) 

p{lt I lt-d)p{lt-d I h-d, rt-d) d'jt-d, (26) 
where we used the assumption of Markov SNR variation to write (|26l ). Furthermore, 

pi'jt-d I et-d,rt-d) 

= P{lt~d I ^t~d,h~d~i,rt-d) (27) 

(28) 

(29) 



p{h~ 


d 


-ft-d,et-d-i,rt-d)pht-d 1 


^t-d-i, rt-d) 


Ip{^t~d 
p{^t- 


it_d^et-d-urt-d)p{,it-d 1 e 
d 1 e{Tt-d,lt-d),^t-d-i)p{.lt^ 


t-d-i,rt-d) dit_d 

d ^t~d~l, ft-d-l) 


Ip{^t~d 


e 


{rt-d,l't-d)^^^t~d-i)p{lLd 


1 et-d-i,rt-d-i) dit-^ 



With conditionally independent error estimates (i.e., p{et \ et, ^t~i) = p{h \ ^t)), this becomes 

p{-ft~d I et-d,rt-d) 



P{h- 


d 


e{rt-d,7t-d))p{lt- 


d 


^t~d~i, ft-d-i) 


Ip{^t~d 


e{rt-d,l't-d))p{lLd 


e 


t~d~i,rt-d~i) c?7l-d 



(30) 



Similar to (|26l) . we can also write 

p(7t_d+i I et_d,rt-d) 
= J p{lt~d+i I lt~d, h^d, rt-d) 



X p{'^t-d I et-d,rt-d) d-ft~d (31) 
Jp{lt-d-,^\lt-d)p{lt-d\et.d,rt.d)d^t-d. (32) 
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Equations (|26l ). (|30l ). and (|32l ) lead to the following recursive implementation of the greedy rate assignment 
(|2?1) . Assuming the availabilitjQ of p{'-ft-d I rt^^^i) when calculating r^, the rate assignment 

procedure for packet indices t = 0, . . . , T is: 

1) easure et_d, compute p{et-d \ It-d)) as a function of 'yt-d, and then calculate the distribution 
p{,lt~d I h^d, rt^d) using dSO]). 

2) Calculate ^(74 | ej_<i, rt^d) using the Markov prediction step (l26l) . 

3) Calculate ft via (1241) . 

4) l£] (i > 1, then calculate p{%^d+i I ^4^^, rt„d) via (|32l) for use in the next iteration. 



5. Block-Rate Algorithm 

Since it may be impractical for the transmitter to adapt the rate on a per-packet basis, we now propose 
a modification of the algorithm detailed in Section IIV-AI that adapts the rate only once per block of n 
packets. The main idea behind our block-rate algorithm is that the SNR {'jt} and error-rate estimates {it} 
are treated as if they were constant over the block, thereby allowing a straightforward application of the 
method from Section IIV-AI Though this treatment is suboptimal, our intention is to trade performance for 
reduced complexity. 

The details of our block-rate algorithm are now given. Denoting the block index by i, the block versions 
of the degraded error-rate estimate and SNR are defined as 

(i+l)n-l 

= - E (33) 

t=in 

li - 7m+Ln/2j, (34) 

and the assigned rates {rt} axe related to the calculated rates {r^} as 

n = Lit/n\ ■ (35) 

Notice that, when n = 1, the block-rate quantities reduce to the packet-rate quantities, i.e., = 7. = 

and Zi = ^i- 

Borrowing the packet-rate adaptation approach from Section IIV-AI the block-rate greedy implemen- 
tation goes as follows. Here, we use d to denote the delay in blocks. Assuming the availabilit)@ of 
P{li_d I ii-d-i7 Ht-d-i) when calculating r^, the rate assignment procedure for block indices i = 0, . . . , [T/n] 
is: 

1) easure {ejlr(SJ^~\ compute ii_d via ([33]), compute p{ii^d I ^fc-d, 7i„rf)) as a function of 
and then calculate the inferred SNR distribution p(7._^ | ij-d, TLi-d) using 



■i—d 



I SLi-d^Li-d) 

P{ii-d I ^{U-d,li_d))p{li-d 



^i-d-1, Li-d-i) 

(36) 



Ip{ii-d I e(ri-d,7-_J)p(7-_rf I ki-d-i,ri~d~i) di._^ 
2) Calculate ^(7. | e^^d^Hi-d) using the Markov prediction step 

PiX, I ti-d, JLi-d) 

P(l^ I li-d)P(li-d I ^^~d.n_d) d2,_,. (37) 



For the initial packet indices t G {0, . . . , d}, if tiie pdf p{'yt-d \ fit-d-i, f t-d-i) is unknown, then we suggest to use the prior piji-d) 
in its place. 

^ Notice that, if d = 1, then p{jt-d+i \ et-dyft-d) was already computed in step 2). 

* For the initial block indices i G {0, . . . , d}, if the pdf p{'y_ ^ \ ti-d-i'l^i-d-i) unknown, then we suggest to use the prior p{'y_ ^) 



in its place. 
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3) Calculate fj via 



4) l£| (i > 1, then calculate p(7 




(39) 



(38) 



As the adaptation-block size n increases, we expect the packet error rate estimate to become more 
accurate (since it is estimated from, e.g., n ACK/NAKs), the SNR model to get less accurate (since a 
block-fading approximation is being applied to a process that is continuously fading), and the per-packet 
implementation complexity of the algorithm to decrease. 

We note that the block-rate modification proposed here is suboptimal in the sense that the SNR of each 
packet in a block could have been predicted individually, rather than predicting only the SNR of the packet 
in the middle of the block. Likewise, individual rates could have been assigned for each packet in the 
block, rather than a uniform rate for all packets in the block. However, joint optimization of intra-block 
rates appears to be prohibitively complex and thus goes against our primary motivation for the block-rate 
algorithm, i.e., simplicity. 

Finally, we note that a similar block-rate modification can also be applied to the causal genie scheme 
(fT6l) . which has been recognized as a non-degraded-feedback version of the greedy scheme (flOl) . However, 
doing so would spoil the total-goodput optimality of the packet-rate causal genie that was identified in 
Lemma [T] 



We now describe the results of numerical experiments in which we assume uncoded square-QAM 
modulation, a Gauss-Markov fading channel, and minimum variance unbiased (MVU) estimation of the 
error-rate, as detailed below. While other examples of modulation, error-rate estimation, and fading could 
have been employed, we feel that our choices are sufficient to illustrate the essential behaviors of the 
generic rate adaptation schemes discussed in Sections [ni] and |IVl 



For our numerical experiments, we used the uncoded QAM modulation/demodulation scheme described 
in Example 1, which yields the packet error-rate given in ([3]). We used squared-integer constellation sizes, 
i.e., 4-QAM, 9-QAM, 16-QAM, etc. In addition, we used causal degraded error-rate feedback in the form 
of one ACK/NAK per transmitted packet. Thus, in a bloclH of n packets, there were n ACK/NAKs. 

Given this setup, it can be shown that the MVU estimate [21] of the average packet error rate over the 
i-th block can be computed by a simple arithmetic average of the n ACK/NAKs, using for an ACK 
and 1 for a NAK. Notice that this MVU estimate corresponds exactly to the block error-rate estimate 
specified in (l33l) . Furthermore, if denotes the value of the true packet error rate over the i-th block, 
then the number of NAKs per block is Binomial(n, and the error estimate obeys 



Thus, we can calculate = e(£i,7-) as a function of 7. using ^ and plug the results into p(ej | from 
(|40|) in order to compute (l36l) . 



V. Numerical Results 



A. Setup 




for A; = 0, . . . , n 
else. 



(40) 



' Notice that, if d = 1, then p{-y , | ei_j,r_i_j) was already computed in step 2). 
* The results here also hold for packet-rate adaptation through the choice n — 1. 
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To generate the Markov block-rate SNR process {7.}, we first generate a packet-rate complex-valued 
Gauss-Markov "channel gain" [22] process {gt} using 

gt = {1- a)gt-i + awt, (41) 

where {wf} is a zero-mean unit-variance white circular Gaussian driving process and < a < 1. Notice 
that a = 1 corresponds to i.i.d. gains, whereas a = corresponds to a time-invariant gain. We then 
generate a packet-rate SNR process {7*} by scaling the squared magnitude of gt'. 

It = K\gt\\ (42) 

The scaling parameter K in (|42l) is essential because a affects both the (steady-state) coherence time and 
the mean-squared value of the gain {gt}- Thus, by using the two parameters K and a, it is possible to 
independently control the (steady-state) mean and coherence time of the SNR process {7*}. In fact, it can 
be shown that, for steady-state indices t, the SNR 7^ is exponentially distributed with mean value 

To evaluate ^(7. | 7._^), we first notice from (l34l) that p(7j7j„^) = pilt I 1t-nd)- Then, from (HTI) . we 
find that 

nd—l 

gt = {l- af'^gt^nd + a J] (1 - aYwt-,, (43) 

i=o 

where E"=o^(l - ~ CA/'(o, yZ(i^(1 - (1 - af"^))- From this fact, we show in the Appendix 

that 

P{lt I It-nd) = 2ir«(l _ (1 _ «)2nrf) 

/ -(7, + (l-a)2"S-nd)(2-a) \ 
^"^^ \ 2Ka{l - (1 - a)2"'i) J 



'{l-ar^,ffal^di2-ay 
Ka(l — (1 — a) 



B. Results 

Numerical experiments were conducted to investigate the steady-state performance of the greedy al- 
gorithm from Section |IV] relative to three reference schemes: fixed rate, causal genie, and noncausal 
genie, both with and without finite-buffer constraints at the transmitter. The so-called fixed-rate reference 
scheme chooses the fixed rate (i.e., constellation size) that maximizes expected goodput under the prior 
SNR distribution, i.e., argmax^^ / G{rt,'yt)p{lt)d'yf In the absence of feedback, this fixed rate would be 
optimal, i.e., total-goodput maximizing. The causal genie reference scheme defined in Section |lll] adapts 
the rate to maximize expected goodput under perfect causal feedback of the error rate et or, equivalently, 
the SNR 7t. As shown in Section [nil the goodput attained by the causal genie upper bounds that of 
optimal rate selection under degraded feedback. However, as the feedback delay d and/or the block size 
n increases, the causal genie's ability to predict the SNR decreases, and thus its goodput suffers. The 
so-called non-causal genie reference scheme assumes perfect knowledge of SNR 7^ for all past, current, 
and future packets, and uses this information to choose the goodput-maximizing rate. Since this scheme 
has access to more information than the causal-genie and greedy algorithms, it upper bounds them in 
terms of goodput. 
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1) Infinite Buffer Experiments: For the first set of experiments, we assumed an infinitely back-logged 
queue at the transmitter. Unless otherwise noted, the following parameters were used: block size n = 1 
packet, feedback delay d = 1 packet, mean SNR £{7^} = 25 dB, and fading-rate parameter a = 0.001. For 
each channel realization, 200 packets (each consisting of p = 100 symbols) were transmitted. The steady- 
state goodputs reported (per symbol per packet) in the figures were calculated by averaging instantaneous 
goodputs over the packets in 1000 channel realizations for Figs. [3l|4] and 500 channel realizations for 
Figs. [51161 To ensure that steady-state performance was reported, the algorithms were initialized at the 
goodput-maximizing rate for each new channel realization. 

Figure [3l plots steady-state goodput as a function of mean SNR £{7^}. To vary £{7^}, we varied the 
parameter K while keeping a = 0.01. The plot shows the greedy algorithm exhibits an increasing gain 
over the fixed-rate algorithm as mean SNR increases. At low mean SNR, little gain is observed because 
the optimal constellation size is almost always the smallest one, as can be inferred from Fig. [2l But, at 
higher mean SNRs, the greedy algorithm performs about 1 dB worse (in SNR) than the causal genie, 
whereas the fixed-rate scheme performs about 5 dB worse. Furthermore, the SNR gap between the greedy 
and fixed-rate schemes grows as mean SNR increases. Since the steady goodput achieved by the causal 
genie upper bounds that achievable by any causal-feedback-based rate adaptation algorithm, one can infer 
that greedy adaptation based on 1-bit ACK/NAK feedback is sufficient to attain a major fraction of the 
gain achievable by any causal feedback scheme. 

Figure IH shows steady-state goodput versus fading-rate parameter a for mean SNR £{7^} = 25 dB. 
Lower a corresponds to slower channel variation and thus more accurate prediction of instantaneous 
SNR. From the plot, the following can be observed: as a decreases, both the causal genie and the 
greedy algorithm approach the non-causal genie, whereas as a increases, both the causal genie and the 
greedy algorithm approach the fixed-rate algorithm. The non-causal genie and fixed-rate algorithms yield 
essentially constant steady-state goodput versus a. For a wide range of a, it can be seen that the greedy 
algorithm performs closer to the causal genie than it does to the fixed-rate algorithm. Thus, we conclude 
that the greedy scheme captures a dominant fraction (e.g., ^ 90% at low a) of the goodput gain achievable 
under causal feedback. 

Figure [5l plots steady-state goodput versus feedback delay d for packet-rate adaptation, i.e., n = 1. By 
definition, the non-causal genie has access to all past, current, and future SNRs, so its performance is 
unaffected by delay. As for the causal genie and greedy algorithms, their steady-state goodputs measure 
30% and 20% above that of the fixed-rate algorithm, respectively, when d = 1. However, as the delay d 
increases, their causally predicted SNR distributions converge to the prior SNR distribution, so that, the 
causal genie and greedy algorithms eventually perform no better than the fixed-rate algorithm. Still, for 
all delays, the simple greedy scheme captures a dominant fraction of the goodput gain achievable under 
causal feedback. 

Figure [6l plots steady-state goodput versus block size n packets for delay d = 1 packet and a = 0.001. 
For all tested block sizes, the greedy algorithm performs closer to the causal genie than to the fixed-rate 
algorithm, implying that the greedy algorithm once again recovers a dominant portion of the goodput gain 
achievable under the causal feedback constraint. The performances of all adaptive schemes decrease with 
block size, though. This is for two reasons: first, a uniform rate is applied across the block, whereas the 
optimal rate varies across the block; and, second, as the block length increases, the SNR must be predicted 
farther into the future. Notice that even the performance of non-causal genie degrades as n increases due 
to the sub-optimality of its uniform rate assignment across the block. 

Figures [51 and [6l also plot the performance of the so-called quantized genie reference scheme, which 
adapts the rate to maximize goodput under quantized, but otherwise perfect, knowledge of SNR ^t-d- The 
goodput attained by the quantized genie upper bound£^ the goodput attained by a transmitter that assumes 
a finite-state Markov SNR model and employs optimal POMDP-based rate assignment. To construct the 

' Deviations from constant are due to finite averaging effects. 

Tfie fact that the quantized genie yields an upper bound in the case of a finite-state Markov channel follows directly from Lemma [T] 
which holds for both continuous and finite-state Markov channels. 
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corresponding finite-state Markov model, we quantized the SNR using the Lloyd-Max algorithm [23] and 
calculated the state transition probability matrix via [24, eq. (15)-(16)]. Apart from the finite-state SNR 
model, rate assignment for the quantized genie is identical to that for the causal genie. 

Figures \5\ and |6] show that the greedy algorithm outperforms the 2- and 4-state quantized genies, and 
performs on par with the 7-state quantized genie, throughout most of the examined range of d and n. Thus, 
we conclude that the greedy algorithm outperforms the optimal POMDP-based rate adaptation scheme 
based on a finite-state Markov SNR model with 7 states or less. This is notable because the computational 
complexity of optimal POMDP-based rate adaptation is significant, under typical horizons, for channel 
models with more than a few states. 

2) Finite Buffer Experiments: For this second set of experiments, a finite data buffer was employed 
at the transmitter. Bits are removed from the buffer when an ACK arrives, confirming their successful 
transmission, or when the buffer overflows. The following parameters were used: block size n = 1 packet, 
feedback delay d = 1 packet, and mean SNR £{74} = 25 dB. The packet arrival rate followed a 2-state 
Markov model with ON and OFF states. In the ON state, a single packet arrives in the buffer (queue), 
and in the OFF state, no packets arrive. The self transition probability in both ON and OFF states was 
set to 0.9 in order to mimic bursty traffic. Consequently, the steady-state probability of each state is 0.5 
and the long-term arrival rate is 0.5 packets/interval. The size of an arriving packet was set equal to 
the number of bits transmitted (per packet interval) by the fixed-rate reference scheme under backlogged 
conditions. The size of the buffer was set equal to 30 such packets of data. Thus, if packets were arriving 
persistently, then, in the absence of NAKs, the fixed-rate scheme would yield a fixed buffer occupancy, 
while, in the absence of ACKs, the buffer would go from totally empty to totally full after 30 arrivals. 
For each channel realization, 1000 packets were transmitted (each consisting of p = 100 symbols) and the 
buffer was initialized at half-full. The values reported in the figures represent the average of all packets 
in 1000 channel realizations. 

Figure |7] plots average buffer occupancy versus fading-rate parameter a, where a buffer occupancy of 
"6" is to be interpreted as b arrival-packets worth of bits. It can be seen that the buffer occupancy achieved 
by the greedy algorithm is very close to that achieved by the causal and non-causal genie algorithms, 
whereas the buffer occupancy achieved by the fixed-rate scheme is much higher, especially at lower values 
of a. Recall that, when a is low, the SNR can remain below average for prolonged periods of time, during 
which fixed-rate transmissions are more likely to yield NAKs and hence fill the buffer. Figure [8] plots a 
related statistic: the fraction of packets that are dropped due to buffer overflows. Here again, the drop 
rate achieved by the greedy algorithm is very close to that achieved by the causal and non-causal genie 
algorithms, whereas the drop rate achieved by the fixed-rate algorithm is more than 10 times higher. 

Figure |9] shows steady-state goodput versus fading-rate parameter a for Markov arrivals and finite 
buffer size. The steady-state goodput achieved by the greedy scheme is very close to that of the causal 
and non-causal genie schemes, whereas the steady-state goodput achieved by the fixed-rate scheme is 
much lower, especially when a is small. The increase of steady-state goodput with a is directly related to 
the decrease in drop rate with a observed in Fig. [H since dropped packets do not contribute to goodput. 

VI. Conclusion 

In this paper, we studied rate adaptation schemes that use degraded error-rate feedback (e.g., packet-rate 
ACK/NAKs) to maximize finite-horizon expected goodput over continuous Markov flat-fading wireless 
channels. First, we specified the POMDP that leads to the optimal rate schedule and showed that its 
solution is computationally impractical. Then, we proposed a simple greedy alternative and showed that, 
while generally suboptimal, the greedy approach is optimal when the error-rate feedback is non-degraded. 
We then detailed an implementation of the greedy rate-adaptation scheme in which the SNR distribution 
is estimated online (from degraded error-rate feedback) and combined with offline-calculated goodput- 
versus-SNR curves to find the expected-goodput maximizing transmission rate. In addition to the packet- 
rate greedy adaptation scheme, a block-rate greedy adaptation scheme was also proposed that offers the 
potential for significant reduction in complexity with only moderate sacrifice in performance. 
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For the particular case of uncoded square-QAM transmission, packet-rate ACK/NAK feedback, and 
Rayleigh fading, the greedy scheme was numerically compared to three reference schemes: the optimal 
fixed-rate scheme, a genie-aided scheme with perfect causal SNR knowledge, and a genie-aided scheme 
with perfect non-causal SNR knowledge. First, the effects of mean SNR, channel fading rate, and feedback 
delay on steady-state goodput were investigated in the context of an infinitely backlogged transmission 
queue. In this case, the causal genie reference is especially meaningful because it upper bounds the 
performance of the optimal POMDP scheme, which is too complex to implement directly. Second, a 
finite transmission buffer was considered, and the effects of channel fading rate on buffer occupancy, drop 
rate, and steady-state goodput were investigated. The results suggest that the simple packet-rate greedy 
scheme captures a dominant fraction of the achievable goodput under causal feedback, whereas the optimal 
fixed-rate scheme captures significantly less. Similarly, the drop rate and average buffer occupancy of the 
greedy scheme were nearly equal to those of the causal and non-causal genie-aided schemes, whereas 
the drop rate and average buffer occupancy of the fixed-rate scheme were much higher (e.g., an order- 
of- magnitude higher in the case of drop rate). Comparisons to a "quantized genie" scheme that upper 
bounds optimal adaptation under a finite- state Markov SNR model were also made, and there it was 
found that the proposed greedy scheme outperformed the quantized genie scheme with up to 7 states. 
Since POMDP-based optimal rate- adaptation for discrete-Markov channels with 7 or more states would 
be computationally intensive, greedy rate-adaptation based on a continous-Markov channel model is more 
appealing. 



13 



References 

[1] A. Goldsmith, Wirless Communications. New York: Cambridge University Press, 2005. 

[2] A. Goldsmith and S. Chua, "Variable rate variable power M-QAM for fading channels," IEEE Trans. Commun., vol. 45, pp. 1218-1230, 
Oct. 1997. 

[3] D. L. Goeckel, "Adaptive coding for time- varying channels using outdated fading estimates," IEEE Trans. Commun., vol. 47, pp. 844- 
855, June 1999. 

[4] K. Balachandran, S. R. Kadaba, and S. Nanda, "Channel quality estimation and rate adaptation for cellular mobile radio," IEEE J. 

Select. Areas In Commun., vol. 17, pp. 1244-1256, July 1999. 
[5] G. Holland, N. Vaidya, and R Bahl, "A rate adaptive MAC protocol for multi-hop wireless networks," in Proc. ACM Internat. Conf. 

on Mobile Computing and Networking, vol. 5, pp. 3246-3250, 2001. 
[6] B. Sadegi, V. Kanodia, A. Sabharwal, and E. Knightly, "Opportunistic media access for multirate ad hoc networks," in Proc. ACM 

Internat. Conf. on Mobile Computing and Networking, vol. 5, (Atlanta, GA), pp. 3246-3250, 2001. 
[7] J. C. Bicket, "Bit-rate selection in wireless networks," Master's thesis, Massachusetts Institute of Technology, Feb 2005. 
[8] S. Wong, H. Yang, S. Lu, and V. Bharghavan, "Robust rate adaptation for 802.11 wireless networks," in Proc. ACM Internat. Conf. on 

Mobile Computing and Networking, 2006. 
[9] M. Rice and S. B. Wicker, "Adaptive error control for slowly varying channels," IEEE Trans. Commun., vol. 42, pp. 917-925, 

Feb./Mar./Apr. 1994. 

[10] Y.-D. Yao, "An effective go-back-N ARQ scheme for variable-error-rate channels," IEEE Trans. Commun., vol. 43, pp. 20-23, Jan. 
1995. 

[11] S. S. Chakraborty, M. Liinabarja, and E. Yli-Juuti, "An adaptive ARQ scheme with packet combining for time varying channels," IEEE 

Commun. Letters, vol. 3, pp. 52-54, Feb. 1999. 
[12] S. Choi and K. G. Shin, "A class of hybrid ARQ schemes for wireless links," IEEE Trans. Veh. Tech., vol. 50, pp. 777-790, May 2001. 
[13] H. Minn, M. Zeng, and V. K. Bhargava, "On ARQ scheme with adaptive error control," IEEE Trans. Veh. Tech., vol. 50, pp. 1426-1436, 

Nov. 2001. 

[14] A. K. Karmokar, D. V. Djonin, and V. K. Bhargava, "POMDP-based coding rate adaptation for Type-I hybrid ARQ systems over fading 

channels with memory," IEEE Trans. Wireless Commun., vol. 5, pp. 3512-3523, Dec. 2006. 
[15] D. V. Djonin, A. K. Karmokar, and V. K. Bhargava, "Joint rate and power adaptation for type-I hybrid ARQ systems over correlated 

fading channels under different buffer-cost constraints," IEEE Trans. Veh. Tech., vol. 57, pp. 421-435, Jan. 2008. 
[16] G. E. Monahan, "A survey of partially observable Markov decision processes: Theory, models, and algorithms," Management Science, 

vol. 28, pp. 1-16, Jan. 1982. 

[17] C. C. Tan and N. C. Beaulieu, "On first-order Markov modeling for the Rayleigh fading channel," IEEE Trans. Commun., vol. 48, 

pp. 2032-2040, Dec. 2000. 
[18] J. G. Proakis, Digital Communications. New York: McGraw-Hill, 3rd ed., 1995. 
[19] D. Bertsekas, Dynamic Programming and Optimal Control. Athena Scientific, 2nd ed., 2000. 

[20] C. H. Papadimitriou and J. N. Tsitsiklis, "The complexity of Markov decision processes," Mathematics of Operations Research, vol. 12, 
no. 3, pp. 441^50, 1987. 

[21] H. V. Poor, An Introduction to Signal Detection and Estimation. New York: Springer, 2nd ed., 1994. 

[22] I. Abou-Fayal, M. Medard, and U. Madhow, "Binary adaptive coded pilot symbol assisted modulation over rayleigh fading channels 

without feedback," IEEE Trans. Commun., vol. 53, pp. 1036-1046, June 2008. 
[23] A. Gersho and R. M. Gray, Vector Quantization and Signal Compression. Boston: Kluwer Academic Publishers, 1992. 
[24] P. Sadeghi, R. Kennedy, P. Rapajic, and R. Shams, "Finite-state markov modeling of fading channels - a survey of principles and 

applications," IEEE Signal Processing Mag., vol. 25, pp. 57-80, Sept. 2008. 

Appendix 

Here, we derive the expression for ^(7^ | 'jt-nd) given in (l44l) . Let gt,R and gu be the real and imaginary 
parts of channel gain, gt. Also let gt~nd = \gt~nd\e-'^ for 9 ~ U(0,27r). Then 

p{lt\lt-nd)= p{lt\lt-nd,9)p{9)d9. (45) 

JO 

We first find p{\gt\ \ lt~nd,9) in order to evaluate p{\gt\ \ it^nd)- Since 

gt = {l- aY''\gt-ndW' + Z (46) 

fox Z = a Z]r=o and l^f^l = y^, then, conditional on the pair {jt-nd, 9), the random variables 

gt^R and gtj are both Gaussian with mean 

^{gt,R I It-nd, 9} = (1 

and 



E{gt,i I -ft-nd,9} = (1 



a] 



nd 1 1t~nd 

K 



cos 9 



\nd rft—nd . n 



(47) 



(48) 
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respectively, and variance cr| = E{Z^}. Thus conditional on {'jt-nd,(^), the random variable \gt\ 
ah + gli is Rician [1, p. 78]: 



P{\9t\ I lt~nd,0) = ^exp 



+ (1 



X In 



2a| 

nd I It-nd 
K 



Or, 



(49) 



One can see that, given 'yt-nd, the random variable \gt\ is independent of 6. Since 7t = K\gt\'^, we have 

1 



pint I 7i-nd) 



f + (1 - af""^^ 



24 



-'0 



Hence combining (l45l) and (l50l) . we get 

p(7t I 7t-nd) = 



1 /-(7t + (l-«)'"S-nd 

exp 



2AV| 



-'0 77-0 



-nd 



Finally, plugging ct| = 2z^(l - (1 - a)^"'^) into dlB yields (|44l). 

It 



controller 



rt 
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system 



^t-d 



feedback 
channel 



(50) 



(51) 



Fig. 1. System model. 
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Fig. 2. Goodput contours versus SNR 7t and constellation size mt for packet size p = 100. The goodput maximizing constellation size, 
as a function of SNR, is shown by the dash-dot line. 




Fig. 3. Steady-state goodput versus mean SNR E{7t} for a — 0.01, block size n 



— 1 packet, and delay d — 1 packet. 
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Fig. 4. Steady-state goodput versus a for £{74} — 25 dB, block size n — 1 packet, and delay d — 1 packet. 




Fig. 5. Steady-state goodput versus delay d for £{74} — 25 dB, a — 0.001, and block size n = 1 packet. 
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Fig. 7. 

E{7t} = 



Average buffer occupancy versus a for Markov arrivals with average rate 
25 dB, block size n — 1 packet, and delay d — 1 packet. 



= 0.5 packets/interval, buffer size — 30 packets, 
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Fig. 9. 

E{7t} = 



Steady-state goodput rate versus a for Markov arrivals with average rate 
25 dB, block size n = 1 packet, and delay d= 1 packet. 



= 0.5 packets/interval, buffer size = 



30 packets, 



